Subband-based Speech Recognition

ثبت نشده
چکیده

In the framework of Hidden Markov Models (HMM) or hybrid HMM/Artiicial Neural Network (ANN) systems, we present a new approach t o wards automatic speech recognition (ASR). The general idea is to divide up the full frequency band (represent e d i n t e r m s o f critical bands) into several subbands, compute phone probabilities for each sub-band on the basis of subband acoustic features, perform dynamic programming independently for each band, and merge the subband recognizers (recombining the respective, possibly weighted, scores) at some segmental level corresponding to temporal anchor points. The results presented in this paper connrm some preliminary tests reported earlier. On both isolated word and continuous speech tasks, it is indeed shown that even using quite simple recombination strategies, this subband ASR approach can yield at least comparable performance on clean speech while providing better robustness in the case of narrowband noise.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Subband temporal modulation spectrum normalization for automatic speech recognition in reverberant environments

Speech recognition in reverberant environments is still a challenge problem. In this paper, we first investigated the reverberation effect on subband temporal envelopes by using the modulation transfer function (MTF). Based on the investigation, we proposed an algorithm which normalizes the subband temporal modulation spectrum (TMS) to reduce the diffusion effect of the reverberation. During th...

متن کامل

A High-Dimensional Subband Speech Representation and SVM Framework for Robust Speech Recognition

This work proposes a novel support vector machine (SVM) based robust automatic speech recognition (ASR) frontend that operates on an ensemble of the subband components of high-dimensional acoustic waveforms. The key issues of selecting the appropriate SVM kernels for classification in frequency subbands and the combination of individual subband classifiers using ensemble methods are addressed. ...

متن کامل

Subband{based Speech Recognition in Noisy Conditions: the Full Combination Approach Subband{based Speech Recognition in Noisy Conditions: the Full Combination Approach

In this report we investigate and compare di erent subband based Automatic Speech Recognition ASR approaches including an original approach referred to as the full combination approach based on an estimate of the noise weighted sum of posterior probabilities for all possible subband combinations We show that the proposed estimate is a good approximation of the ideal but often unpractical soluti...

متن کامل

Subband parameter optimization of microphone arrays for speech recognition in reverberant environments

In this paper, we present a new subband microphone array processing algorithm specifically designed for speech recognition applications. We previously proposed a speech recognizer-based array processing algorithm which resulted in significant improvements in recognition accuracy when the speech was corrupted by additive noise and moderate levels of reverberation. However, little improvement was...

متن کامل

Robust parameters for speech recognition based on subband spectral centroid histograms

In this paper we propose a new speech parameterization framework that efficiently combines frequency and magnitude information from the short-term power spectrum of speech. This is achieved through computation of subband spectral centroid histograms (SSCH). Relationship between the proposed method and auditory based speech parameterization methods is discussed. An experimental study on an autom...

متن کامل

Recognizing cochlear implant-like spectrally reduced speech with HMM-based ASR: experiments with MFCCs and PLP coefficients

In this paper, we investigate the recognition of cochlear implantlike spectrally reduced speech (SRS) using conventional speech features (MFCCs and PLP coefficients) and HMM-based ASR. The SRS was synthesized from subband temporal envelopes extracted from original clean speech for testing, whereas the acoustic models were trained on a different set of original clean speech signals of the same s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997